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Abstract — We study the exact and optimal repair of multiple 
failures in codes for distributed storage. More particularly, we 
provide an explicit construction of exact minimum bandwidth 
coordinated regenerating codes (MBCR) for n — d + t,k,d > 
k,t > 1. Our construction differs from existing constructions by 
allowing both t > 1 (Le., repair of multiple failures) and d > k 
(i.e., contacting more than k devices during repair). 

I. Introduction 

Codes are useful for tolerating failures in distributed storage 
systems. Yet, erasure correcting codes suffer from huge repair 
costs after failures. Recently, regenerating codes [1] have 
relied on network coding to achieve the optimal tradeoff 
between storage cost and bandwidth (repair cost). Such codes 
have latter been extended to support multiple failures (i.e., 
coordinated or cooperative regenerating codes) [2]-[4]. These 
studies identify two main types of regenerating codes: (i) Min- 
imum Storage (MSR/MSCR) regenerating codes which min- 
imize storage costs in priority and (ii) Minimum Bandwidth 
(MBR/MBCR) which minimize bandiwdth costs in priority. 
Codes achieving these optimal tradeoffs can be built using 
random linear network codes. Yet, such non-deterministic 
schemes are not desirable for they require complex integrity 
checking scheme, and cannot be turned into systematic codes. 

Hence, regenerating codes have been studied with the addi- 
tional constraint of repairing exactly what is lost, in order to 
build deterministic coding schemes. The problem of repairing 
exactly a single failure has been well studied both at the MBR 
point [5] and MSR [5]-[9] point. However, the exact repair of 
multiple failures remains an open question since it has been 
studied only for the very specific setting d = k at both the 
MSCR point [4] and the MBCR point [10]. 

We focus on this problem and propose an explicit code con- 
struction for the case of n = d+t, k, d > k, t > 1 thus relaxing 
the constraint d = k present in previous constructions [10]. In 
the following, we will present our code construction, the repair 
algorithm and the decoding algorithm. 

II. CROSS-MBCR Codes 

Minimum Bandwidth Coordinated Regenerating Codes [2] 
guarantee that a file of M original blocks encoded to net 
blocks stored uniformly on n devices can be recovered from 
any k devices, and that if t devices fail the optimal repair 
procedure consist in downloading f3 blocks from d non-failed 



devices and j3' blocks from the t — 1 other devices being 
repaired. If we set f3 f = 1, 

a = 2d + t-l f3 = 2 M = k{2d-k + t) 

CROSS-MBCR codes are built upon two encoding matrices: 

• , which is the generator matrix of (n, k) MDS code 
(e.g., Cauchy or Vandermonde matrices), 

• \I> , which is the generator of an (n — MDS code, 
such that (Id,d, ^) is also the generator of a systematic 
MDS code (e.g., Vandermonde or Cauchy matrices). 

They support any k, any d > k, any t > 1 and only require 
that n = d+t (i.e., all devices participate to the repair either as 
a device providing data or as a device being repaired). In the 
rest of this section, we will describe the encoding procedure, 
the repair procedure and the decoding procedure, thus defining 
completely the code and showing that it satisfy all needed 
properties. 

A. Encoding 

The encoding procedure is shown on Figure la. Given the 
condition n = d + t, the file size M can be rewritten as 
kn + k(d—k). The file (xi, . . . , xm) is divided in n sequences 
ai, . . . , a n and d — k sequences 6i, . . . , bd-k such that 

a i — (^(i— l)/c+l ? • • • i x ik)i=l...n 

= ( x (j-l)k+l+kni • • • ,%jk+kn)j=l...d-k 

Step 1: The sequence is written on the first k positions 
of the device i. Each sequence bj is encoded using the 
generator \I>, and the i th resulting block is stored at the k + j 
position of device i. Let us name wi = (a^ ^ \B) the sequence 
of blocks stored on the d first positions of device i, where B 
is the (k x d — k) matrix with bj as column vectors. These d 
blocks are designated as primary data blocks. 

Step 2: The primary data blocks Wi stored on device 
i are encoded using and the resulting n — 1 blocks are 
stored on all other devices. Each device stores a total of n — 1 
blocks pi = ($\w i(B1 & 2 w im ... ^_i^i n-i) where 
© is the addition modulo n. These blocks are designated as 
secondary data blocks. The technique used for building these 
secondary block is similar to the one used in [10]. 

B. Repairing 

The repair procedure is illustrated on Figure 2. 
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Figure 1. Code for a file stored to n = 6 devices so that any k = 2 devices can recover the file, and t = 2 devices can repair from d = 4 devices 



Ste/? 7: Each device being repaired i fetches the sec- 
ondary data blocks <$>Wi still available from live devices. Since 
<1> is the encoding matrix of an MDS code, the device being 
repaired can decode and recover the primary data blocks Wi. 

Step 2: The missing secondary data blocks, initially 
stored on failed devices, are re-generated from primary data 
blocks as done in the second step of the encoding. 
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Figure 2. Repairing a code (n = 6, k = 2, d = 4, t = 2) 
C. Decoding 

The decoding procedure is shown on Figure lb and use both 
the primary data and the secondary data. 

Step 1: The primary data is retrieved from the k contacted 
devices. The encoded part is decoded using since \£ is the 
generator matrix of an (n, k) MDS code. A total of k(k+d—k) 
original blocks are recovered during this step. 

Step 2: The part decoded during step 1 is encoded again 
using ^ to recreate the encoded part of the primary data blocks 
stored on the n — k non contacted devices. 

Step 3: The secondary data blocks retrieved from the 
k contacted devices and the encoded primary data blocks 
recreated at step 2 are processed together and decoded using 
<E> -1 to recover the remaining k(n — k) original blocks. This 
is possible since {Id,d 3>) is the generator matrix of an 



(n — 1 + d : d) systematic MDS code. As a consequence, the 
file can be fully recovered from any k devices. 

III. Conclusion 

In this paper, we define an explicit exact MBCR codes 
construction, thus showing that it is possible to build exact 
minimum bandwidth coordinated regenerating codes for a 
wide set of parameters (n = d + t : k,d,t). It is interesting 
to notice that if d — k, the sequences bj encoded using ^ 
disappear, and the whole scheme degenerates into the same 
coding scheme as initially proposed for d — k by Shum et 
al. [4]. As a consequence, our scheme encompass the single 
previous exact MBCR codes construction as a special case. 
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Context 

•Regenerating codes reduce the repair cost due to a failure in distributed 
storage systems 

•Coordinated regenerating codes support multiple failures 
•Codes can be built using random codes 
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Motivation 

•Exact codes offer increased security and usability 
•The case of single failures has been well studied 
•The case of multiple failures is open except for d=k 



Encoding and storing a file 

•Split the file in M=k(2d-k+t) = kn + k(d-k) blocks 
•Store the first kn blocks (kon each devices) 
•Encode the d-k groups of k blocks to d-k groups of n blocks 
using an MDS encoding ^ and store 



•For each device, encode the d primary blocks to n-1 

secondary blocks using an "MDS" encoding O 

•Store these blocks (one on each of the n-1 other devices) 
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Recovering the file 

•When accessing any k devices, the file can be 
recovered by fetching the primary data from these k 
devices, decoding using and using the recovered 
data to decode the secondary data to recover 
additional primary data 




Conclusion 

•Exact MBCR codes exist for any values n=d+t f k f d f t 
•CROSS MBCR is a way to construct such codes 



Repairing failed devices 

•Recover the primary data from secondary data stored on 
non-failed devices by decoding with (D 1 
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•Recover the secondary data by encoding again using O 




